Large-scale microbiome data integration enables robust biomarker identification
نویسندگان
چکیده
Abstract The close association between gut microbiota dysbiosis and human diseases is being increasingly recognized. However, contradictory results are frequently reported, as confounding effects exist. lack of unbiased data integration methods also impeding the discovery disease-associated microbial biomarkers from different cohorts. Here we propose an algorithm, NetMoss, for assessing shifts network modules to identify robust associated with various diseases. Compared previous approaches, NetMoss method shows better performance in removing batch effects. Through comprehensive evaluations on both simulated real datasets, demonstrate that has great advantages identification disease-related biomarkers. Based analysis pandisease studies, there a high prevalence multidisease-related bacteria global populations. We believe large-scale will help understanding role microbiome more perspective accurate biomarker greatly promote microbiome-based medical diagnosis.
منابع مشابه
Robust Classification of Protein Variation Using Structural Modeling and Large-Scale Data Integration
Existing methods for interpreting protein variation focus on annotating mutation pathogenicity rather than detailed interpretation of variant deleteriousness and frequently use only sequence-based or structure-based information. We present VIPUR, a computational framework that seamlessly integrates sequence analysis and structural modeling (using the Rosetta protein modeling suite) to identify ...
متن کاملRobust classification of protein variation using structural modelling and large-scale data integration
Existing methods for interpreting protein variation focus on annotating mutation pathogenicity rather than detailed interpretation of variant deleteriousness and frequently use only sequence-based or structure-based information. We present VIPUR, a computational framework that seamlessly integrates sequence analysis and structural modelling (using the Rosetta protein modelling suite) to identif...
متن کاملIntegrated Robust Identification and Control of Large-Scale Processes
We propose the use of pseudo-singular values, which are closely related to singular values but are allowed to have sign, as a convenient approach for developing techniques for the identification and control of large-scale processes. Steady-state controllability can be assessed directly in terms of the pseudo-singular values. It is shown that to control an output disturbance direction with zero ...
متن کاملMicrofluidic large-scale integration.
We developed high-density microfluidic chips that contain plumbing networks with thousands of micromechanical valves and hundreds of individually addressable chambers. These fluidic devices are analogous to electronic integrated circuits fabricated using large-scale integration. A key component of these networks is the fluidic multiplexor, which is a combinatorial array of binary valve patterns...
متن کاملAn Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity
The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Nature Computational Science
سال: 2022
ISSN: ['2662-8457']
DOI: https://doi.org/10.1038/s43588-022-00247-8